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(54) Hepatocyte growth factor activator Inhibitor 

(57) A novel protein having inhibitory activity on the 
protease activity of hepatocyte growth factor (HGF) acti- 
vator was purified and isolated, and its molecular weight 
(ca. 30,000 daltons) and partial amino acid sequence 
were determined. A gene coding for the protein was 
cloned, and the gene DNA was incorporated into a vec- 
tor, for transforming host cells. Cultivation of the trans- 
formant gave the desired protein. The protein can be 
used as an in yM or in yjfta control factor for HGF or 
HGF activator It is also useful as an antigen to be used 
in producing an antibody to be used as means for 
kinetic studies of the protein, or as a standard in assay 
systems therefor. 
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Description 

FIELD OF THE INVENTION 

5 The present invention relates to a novel protein and a DNA coding for the same. More particularly, it relates to a 

novel protein having inhibitory activity on the protease activity of hepatocyte growth factor activating factor (HGF acti- 
vator) (hereinafter this protein in sometimes referred to also as "HAI-M"), a gene coding for the protein, an expression 
vector containing the gene, a transformarrt as transformed with the expression vector, and a method of producing HAI- 
II using the transformant. 

10 

BACKGROUND OF THE INVENTION 

It is already reported that thrombin activates the precursor of HGF activator (J P-A-5- 103670. JP-A-6-U1859. JP- 
A-6- 153946 and JP-A-6- 153966 (the term \JP-A* as used herein means an "unexamined published Japanese patent 

15 application")); factor having activity to convert the single chain form of hepatocyte growth factor (HGF) to its double 
chain form) in the manner of positive activity control. However, any human tissues-derived protease inhibitor capable of 
inhibiting, as a negative control factor, the physiological activity of HGF activator has not been known. Therefore, how 
HGF activator is controlled in human tissues remains unknown. Such a negative control factor might also influence indi- 
rectly on the activity of hepatocyte growth factor (HGF) on which HGF activator acts. Thus, for the analysis of the mech- 

20 anism of action of HGF in vivo, as well, it has been demanded that such a human tissues<Jerived protease inhibitor be 
isolated and identified. 

By using such a protease inhibitor and an antibody to the protease inhibitor, it would become possible to know the 
to vivo physiological activity of HGF activator, analyze the mechanism of action thereof or analyze the mechanism of 
control of HGF activation, from a standpoint different from those of the prior art. 

25 Furthermore, for investigating the detailed in vivo function of HAM I or the effect of HAI-II in hepatic disorder, for 
instance, HAI-II is required in large quantities. At present, however, there is only one method available for preparing 
HAI-II, which method comprises using, as a starting material, the culture supernatant obtained with a human cancer 
cell line such as MKN45 or A549 cells and purifying therefrom HAI-II occurring therein in trace amounts. This method 
is not always the best one from the labor, time and cost viewpoint. It encounters great difficulties in stably isolating th 

30 minor amount of HAI-II alone. Therefore, it has been desired that an expression system be constructed so that HAI-II 
can be obtained stably and in large quantities. 

SUMMARY OF THE INVENTION 

35 The present inventors have conducted screening of various cultured cell lines using, as an indicator, the inhibitory 
activity on the protease activity of hepatocyte growth factor activator and have found that a substance having the activity 
occurs in the culture supernatant of certain human cancer cell lines (MKN45 cells, A549 cells and like epithelial tumor 
cell lines). To reveal the nature of its inhibitory activity, they further attempted to purify the substance from the MKN45 
cell culture supernatant using various column chromatography techniques. As a result, they have found a novel protein 

40 with a molecular weight of about 30,000 daltons as determined by SDS (sodium dedecyt sulfate) -polyacrylamide gel 
electrophoresis<PAGE) and, they also have obtained an amino-termtnal amino acid sequence of this protein by analyz- 
ing the protein on a protein sequencer. Further, they determined partial amino acid sequences by decomposing the pro- 
tein using proteolytic enzymes, isolating the resultant peptides and subjecting each peptide to the same amino acid 
sequence analysis as mentioned above. Furthermore, they estimated DNA base sequences based on the partial amino 

45 acid sequences and conducted screening of a cDNA library using oligonucleotide probes prepared based on the 
sequences. As a result they have succeeded in cloning a gene coding for the protein and have now completed the 
present invention. 

Furthermore, as a result of various investigations to produce the protein stably and in large quantities using the 
recombinant ONA technique and. the present inventors have constructed a novel expression vector coding for the pro- 

50 tein and have enabled expression of the protein. Thus, by constructing a plasmid for protein expression by inserting a 
DNA fragment coding for part or the whole of the amino acid sequence of the protein into a plasmid vector such as the 
expression vector pMEl8S for use in animal cells or an expression vector for use in yeasts, Escherichia ggJi and the 
like, at a site downstream from the promoter thereof and using the thus-obtained recombinant plasmid to transform host 
cells, they have now completed the present invention in another aspect. 

55 The present invention thus relates to a protein having the following physico-chemical properties: 

(1) a molecular weight of about 30,000 daltons as determined by SDS-polyacrylamide gel electrophoresis; 

(2) inhibitory activity on the protease activity of hepatocyte growth factor activator; and 

(3) one of the amino acid sequences depicted in the sequence listing und r SEQ ID NO:1 through 3 or an amino 



2 



EP0 758 682 A2 



acid sequence substantially equivalent thereto; proteins respectively having the amino acid sequences depicted in 
the sequence listing under SEQ 10 N0:1 through 3 or amino acid sequences substantially equivalent thereto and 
having inhibitory activity on the protease activity of hepatocyte growth factor activator; a protein having the amino 
acid sequence depicted in the sequence listing under SEQ ID NO:4 or on amino acid sequence substantially equiv- 

5 alent thereto; and a protein having, as its amino acid sequence, that segment of the amino acid sequence depicted 
in the sequence listing under SEQ ID NO:4 which starts with the 28th amino acid (alanine) residue and ends with 
the 252nd amino acid (leucine) residue, or an amino acid sequence substantially equivalent thereto; DNAs and 
genes coding for the proteins defined above; expression vectors respectively containing the DNA or genes; 
transformants obtained by transformation of host cells with the expression vectors; as well as a method of produc- 

w ing proteins having inhibitory activity on the protease activity of hepatocyte growth factor activator which comprises 
cultivating the transformants. 

The base sequence shown in the sequence listing under SEQ ID NO:4 contains only one strand, with the other 
complementary base sequence being omitted. Starting with this gone and using the recombinant DNA technology, it is 

is possible to cause expression of, for example, the protein having the amino acid sequence shown in the sequence listing 
under SEQ ID NO:4. On that occasion, the protein translated from mRNA coding for the protein contains a signal 
sequence. After extracellular excretion, however, the signal sequence has been cleaved off and the protein obtained 
has an amino acid sequence comprising the 28th amino acid (alanine) residue end the subsequent amino acid residues 
of the amino acid sequence shown in the sequence listing under SEQ ID N0:4. Signal sequences of other proteins may 

zo also be used as the signal sequence. For signal sequence-free mature protein expression in host cells, a gene having 
that portion of the base sequence shown in the sequence listing under SEQ ID NO:4 which comprises the 82nd nucle- 
otide (guanine) residue and the subsequent nucleotide residues may be used as the gene coding for the relevant pro- 
tein and joined to the ATG codon of a vector. The present invention further includes, within the scope thereof, 
modifications of the proteins or DNAs mentioned above as derived therefrom by deletion, substitution and/or addition 

25 of one or more amino acid or nucleotide residues within limits not harmful to the inhibitory activity on the protease activ- 
ity of HGF activator, namely those proteins or DNAs that respectively have "substantially equivalent amino acid 
sequences" or "substantially equivalent base sequences". 

BRIEF DESCRIPTION OF THE DRAWINGS 

30 

Fig. 1 shows the results of assaying of the protein of the present application for its inhibitory activity on the protease 
activity of HGF activator. 

Fig. 2 shows the structure of the plasmid pME18S-HAMl. 

35 DETAILED DESC RIPTION OF THE INVENTION 

In the following, the present invention is described in further detail. The novel protein of the present invention which 
has protease inhibitor activity can be obtained by proceeding via such purif ication steps as mentioned below. For exam- 
ple, a human cancer cell line (MKN45 cells or A549 cells deposited with the Japanese Cancer Research Resources 

40 Bank under the deposite numbers JCRB0254 and JCRB0076, respectively, or like epitherlial tumor ceil line) is culti- 
vated in a serum-free medium for several days, the culture supernatant is recovered and, after removal of cells there- 
from and concentration, submitted to a heparin-Sepharose column (available e.g. from Pharmacia). The non-adsorbed 
fraction is submitted to a ConA-Sepharose column (available e.g. from Pharmacia) and separated into an adsorbed 
fraction and an non-adsorbed fraction. The non-adsorbed fraction is subjected to hydrophobic chromatography using 

45 Phenyl-5PW (available e.g. from Tosoh Corp.). The thus-obtained fraction containing the desired protein is chromato- 
graphed on a DEAE ion exchange column (available e.g. from Polymer Laboratory), then submitted to a hydroxyapatite 
column (available e.g. from Mitsui Toatsu Chemicals or Seikagaku Corp.), and further to gel filtration column chroma- 
tography (using e.g. Asahi Chemical Industry's GS520) to give the protein in question. The purification steps may fur- 
ther include reversed phase column chromatography and/or other appropriate means, as necessary. 

50 Upon SDS-polyacrylamide gel electrophoresis, the thus-purified protein of the present invention migrates as a 
smear band or several fragments presumably resulting from differences in sugar chain, amino acid residue modif icati n 
and/or C-terminal side mutation and having a molecular weight of about 30.000 daltons. When reacted with HGF acti- 
vator, the protein shows inhibitory activity on the protease activity of HGF activator. This protein of the present invention 
contains the amino acid sequence shown in Table 1 below. 

55 A DNA fragment of the gene coding for the novel protein of the present invention can be obtained in the following 
manner. By analyzing the novel protein purified in the above manner using a gaseous phase protein sequencer (avail- 
able e.g. from Applied Biosystems), its amino-terminal amino acid sequence can be determined. Further, the protein is 
decomposed using lysyl endopeptidase (e,g. Achromobacter protease I), the resulting peptide fragments are separated 
by reversed phas high-performance liquid chromatography (using e.g. a YMCs column) and each fragment is sub- 
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jected to amino acid sequence analysis in the same manner as mentioned above, whereby the amino acid sequence 
of an intermediate portion of the protein can be revealed. 

A DNA base sequence is deduced from the amino acid sequence thus determined and, correspondingly appropri- 
ate oligonucleotides are synthesized and used as probes. Human-derived liver, spleen and placenta cDNA libraries 

5 (available e.g. from Clonetec), among others, can be used as the cDNA library for sere ning out the gene coding for the 
desired protein. In addition, a cDNA library may be constructed in the conventional manner from & cell line or tissue 
material in which the protein is expressed. 

Escherichia, ggl is transfected with X phage containing such cDNA incorporated therein (the method of Maniatis et 
al.: "Molecular Cloning") and then cultivated. The plaques formed are subjected to selection by plaque hybridization 

10 using oligonucleotide probes prepared based on the base sequence deduced from the amino acid sequence of a por- 
tion of the protein in question, whereby a certain number of different \ phage clones having the amino acid sequence 
of the desired protein and containing, in addition, those segment base sequences of the protein that correspond to 
other regions than the probes can be obtained with ease. 

Then, the phage from each positive plaque obtained in the above screening is then allowed to replicate by the 

is method of Maniatis et al., and DNA is purified therefrom by the glycerol gradient method and, after appropriate restric- 
tion enzyme cleavage, submitted to cDNA subcloning into a plasmid vector such as pUCl8 or pUCl9 or a single chain 
phage vector such as Ml3mpl8 or M13mpt9. Thereafter, the base sequence of the desired cDNA fragment can be 
determined by the method of Sanger et al. The base sequences of the clones obtained are analyzed and synthesized 
and, as a result a gene totally corresponding to the whole amino acid sequence of the desired protein as shown in the 

20 sequence listing under SEQ ID NO:4 can be derived from a group of cDNAs coding for respective portions of the pro- 
tein. It is also possible to obtain a gene containing the whole of the cDNA in question, a gene containing the cDNA with 
deletion of a partial base sequence thereof, a gene containing the cDNA with insertion of some other base sequence, 
a gene containing the cDNA with substitution of some other base sequence for a partial base sequence of the cDNA, 
or the like gene from a variety of cDNA libraries by the PCR method using portions of the cDNA in question as probes! 

25 Such site-specific mutation, inclusive of base sequence deletion, addition or substitution, can be readily realized by the 
methods described in the literature (e.g. Methods in Enzymol., 217, 218 (1993); 21Z 270 (1993)). 

The group of cDNAs obtained in the above manner are joined together so that the order of the base sequences is 
fit to the amino acid sequence of the protein, to give a DNA fragment covering the whole region of the protein. The DNA 
fragment is inserted into a plasmid, such as pCDL-SRa296, at a site downstream from the promoter thereof and 

30 matched in phase with the translation initiation codon ATG, to thereby construct a protein expression vector. Then, the 
protein can be expressed in a host, for example animal cells, transformed with the plasmid. Thereafter, the protein 
expressed can be recovered by purification by a conventional method. 

Thus, each of the thus-obtained cDNAs is inserted into a plasmid, such as pME18S. at a site downstream from the 
promoter thereof to thereby Construct a plasmid for protein expression. The protein or a protein derived therefrom by 

35 partial amino acid sequence deletion, insertion or substitution can be expressed in a host, such as animal cells, trans- 
formed with the expression plasmid. More concretely, CHO cells, COS cells, mouse L cells, mouse C127 cells, mouse 
FM3A cells and the like can be used as the animal cells for protein expression. When these animal cells are used as 
the host, the use, as a signal sequence, of that portion of the DNA base sequence shown in the sequence listing under 
SEQ ID NO:4, namely the gene tor the protein, which starts with the 1st nucleotide and ends with the 35th nucleotide, 

40 or the use of an existing signal sequence is expected to be conducive to extracellular secretory production of the protein 
or production thereof on the cell membrane. 

The expression plasmid for use in animal cell hosts is constructed in the following manner. As the promoter, use 
can be made of any of the existing promoters, for example the SRa promoter. SV40 promoter or metallothionein gene 
promoter. A DNA containing the whole gene for the protein, inclusive of the above-mentioned signal-like sequence, a 

45 DNA containing the gene with a partial base sequence deletion, a DNA containing the gene with insertion of a base 
sequence or a DNA containing the gene with substitution of some other sequence for a partial base sequence thereof 
is inserted into a site downstream from the promoter in the direction of transcription. In constructing the expression 
plasmid for the protein, two or three pieces of the DNA fragment of the gene coding fa the protein may be joined 
together and used for insertion downstream from the promoter. It is also possible to join such a promoter as the SV40 

so promoter to the 5* upstream side of the DNA fragment of the gene coding for the protein to give a unit insert and insert, 
into a vector, two or three such units joined together in the same direction of transcription. A polyadenylation site is 
added to the downstream side of the gene coding for the protein. For example, the polyadenylation site derived from 
the SV40 DNA, p-globin gene or metallothionein gene can be joined to the downstream side of the gene coding for the 
protein. When a DNA fragment comprising a promoter and the gene coding for the protein as joined together is dupli- 

55 cated or triplicated, each unit may contain a polyadenylation site on the 3* side of the gene coding for the protein. In 
transforming animal cells, for example CHO cells, with such expression vector, a drug resistance gene can be used for 
the purpose of expression cell selection. As the drug resistance gene, there may be mentioned the DHFR gene which 
provides resistance to methotrexate (J. Mol. Biol., 159. 601 (1982)), the Neo gene which provides resistance to th anti- 
biotic G-418 (J. Mol. Appl. Genet., 1 327 (1982)), the Escherichia mli-derived Ecogpt gene which provides resistance 
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to mycophenolic acid (Proc. Natl. Acad. Sci. U.S.A., 78, 2072 (1981)) and the hpn gene which provides resistance to 
the antibiotic hygromycin (Mol. Cell. Biol., 5, 410 (1985)), among others. Each resistance gene contains a promoter, 
such as the above-mentioned SV40*derived promoter, inserted on the 5' upstream aide and a polyadenylation site as 
mentioned above on the 3' downstream side of each resistance gene. In inserting such resistance gene into the expres- 

5 sion vector for the protein, the gene may be inserted at a site downstream from the polyadenylation site of the gene cod- 
ing for the protein, in either direction, the same or opposite. These expression vectors make it unnecessary to perform 
double transformation with another plasmid Containing a selective marker gene for the purpose of transformant isola- 
tion. When the expression vector for the protein does not contain such a selective marker gene insert, a vector having 
a marker suited for transformant selection, for example pSV2neo (J. Mol. Appl. Genet, 1 327 (1982)), pMBG (Nature. 

w 224, 228 (1981)), pSV2gpt (Proc. Natl. Acad. Sci, U.S.A., 78, 2072 (1981)) or pAd -D26- 1 (J. Mol. Biol., 152, 601 
(1982)), may be used, in combination with the expression vector for the gene coding for the protein, for transformation 
to thereby make it easy to perform transformant selection based on phenotypic expression of the drug resistance gene. 

The expression vector can be introduced into animal cells by the calcium phosphate method (Virology, §2, 456 
(1973)) or the electroporation method (J. Membr. Biol., 1Q, 279 (1972)), for instance. The transformed animal cells can 

15 be cultivated in the conventional manner in the manner of suspension culture or adhesion culture. They are cultivated 
in a medium such as MEM or RPM1 1640 in the presence of 5 to 10% of serum or in the presence of an appropriate 
amount of insulin, transferrin or the like, or under serum-free conditions. Further, it is also possible to produce the pro- 
tein using microorganisms such as yeants or Escherichia coli. for example strains of Saccharomvces cerevisiae or the 
strain Escherichia coli YA-21. since the cells express the protein in the culture supernatant or on the cell surface, it is 

20 possible to recover and purify the protein using the culture supernatant or cells of this transformant. More specifically, 
the protein can be isolated and purified by subjecting the culture supernatant or cell extract containing the protein pro- 
duced to an appropriate chromatography procedure, for example chromatographic treatment using heparin-Sepharose. 
ConA-Sepharose. hydroxyapatite and the like in combination. 

The protease inhibitor activity-endowed protein of the present invention has inhibitory activity on the protease activ- 

25 ity of HGF activator and, therefore, is useful as a in yjftQ or jn viva regulatory factor for HGF activator or, indirectly, as a 
HGF activity regulating factor. The protein as well as an antibody to the protein or a gene coding for the protein is further 
useful as a tool or means for function analysis of the factors. 

Furthermore, by introducing an expression vector carrying a gone coding for the protein into animal cells, it 
becomes possible to produce part or the whole of the protein or a protein equivalent thereto, which is biologically active, 

30 in a stable manner and in large quantities. This has so far been difficult to attain. 

The present invention is now illustrated in greater detail with reference to the following Examples. However, it is not 
intended that the present invention be limited to these Examples. 

EXAMPLE 1 

35 

(Purification of the protein using an MKN45 cell culture supernatant) 

MKN45 cells (Naito et al., Gan to Kagaku-ryoho (Cancer and Chemotherapy), 5, 89 (1978)) (obtained from Meneki 
Seibutsu Kenkyusho) were seeded into eRDF medium containing 5% FBS (fetal bovine serum) as placed in a roller bot- 

40 tie 850 and allowed to multiplicate until a confluent state was attained. Then, the FBS-containing culture supernatants 
was removed and the cells were washed with two portions of serum-free eRDF medium. After removing the washing 
medium, 500 ml of serum-free eRDF medium was added and incubation was carried out at 37°C for 3 to 6 days. After 
incubation, the culture supernatants was recovered. 500 ml of fresh serum-free eRDF medium was added, and incuba- 
tion was again conducted. This procedure was repeated several times. The culture supernatants thus recovered w r 

45 combined and concentrated about 20-fold using a YM30 ultrafiltration membrane (Amicon). 

This concentrate was submitted to a heparin-Sepharose column (equilibrated with PBS) and the non-adsorbed 
fraction was recovered. This fraction was submitted to a ConA-Sepharose column (equilibrated with PBS) and sepa- 
rated into the non-adsorbed fraction and an adsorbed fraction eluted with a PBS solution containing 200 mM a-methyl- 
D-mannoside. The ConA non-adsorbed fraction was concentrated using YM30, followed by buffer substitution to 10 mM 

so phosphate buffer (pH 6.8) containing 1 M ammonium sulfate. The new solution was subjected to HPLC using Phenyi- 
5PW (Tosoh Corp.; equilibrated with 10 mM phosphate buffer (pH 6.8) containing 1 M ammonium sulfate), followed by 
linear concentration gradient etution with 1 M ammonium sulfate to 0 M ammonium sulfate. A fraction containing the 
desired protease inhibitor activity was thus recovered. 

The fraction was dialysed against 20 mM Tris-hydrochloride buffer (pH 8) containing 0.05% CHAPS and then sub- 

55 jected to HPLC using DEAE (equilibrated with 20 mM Tris-hydrochloride buffer (pH 8) containing 0.05% CHAPS), fol- 
lowed by linear concentration gradient elution with 0 M to 500 mM NaCI, whereby a fraction showing the desired 
protease inhibitor activity was recovered. The fraction was dialyzed against 5 mM phosphate buffer (pH 6.8) containing 
0.05% CHAPS and then subjected to HPLC using a HCA A-4007 column (product of Mitsui Toatsu Chemicals) (equili- 
brated with 5 mM phosphate buffer (pH 6.8) containing 0.05% CHAPS), and the non-adsorbed fraction was recovered. 
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The fraction was submitted to GS-520 (equilibrated with PBS containing 0.05% CHAPS) and an active fraction (fraction 
of about 40 to 20 kDa) was recovered. For eliminating minor bands, the fraction was applied to a YMC pack C4 column 
(obtained from YMC), linear concentration gradient elution was carried out over 40 minutes using acetonitrile-isopropyi 
alcohol (3/7) containing 0.1% TFA and varying the concentration thereof from 10% to 50%, and the active fraction was 
5 neutralized with 1 M Tris-hydrochloride buffer (pH 8) and then dried under reduced pressure. After drying, the solid 
obtained was dissolved in PBS containing 0.05% CHAPS to give a purified protein solution. 

EXAMPLE 2 

10 (Amino-terminal amino acid sequence and partial amino acid sequence determination of the protein) 

The protein having protease inhibitor activity as purified as in Example 1 and eluted by reversed phase HPLC was 
dried under reduced pressure without neutralization. This was dissolved in 60 jil of 50% TFA (triftuoroacetic acid), 
added to a polybrene-treated glass filter and subjected to Edman degradation on an Applied Biosystems model 470 A 

is sequencer, and the amino acid sequence of an N-terminal region was determined. Phenylhydantoin (PTH)-amino acids 
were identified using a Mitsubishi Chemical's MCI gel ODS IHU column (0.46 x 15 cm) and conducting single solvent 
elution with acetate buffer (10 mM acetate buffer (pH 4.7), 0.01% SDS, 38% acetonitrile) at a flow rate of 1 .2 ml/minute 
and a temperature of 43°C. PTH-amino acids were detected based on the absorbance at 269 nm. 
As a result the N-terminal amino acid sequence shown below in Table 1 was identified. 

20 Then, the same protein having protease inhibitor activity as purified as in Example 1 and eluted by reversed phase 
HPLC was dissolved in 100 \sl of 50 mM Tris-hydrochloride buffer (pH 9.0) containing 4 M urea, lysyl endopeptidase 
(Achromobacter protease I) was added to the solution, and the reaction was carried out at 37°C for 8 hours. The result- 
ing peptide mixture was separated by reversed phase HPLC using a YMC pack C8 column (YMC) to give respective 
peptide fragments. Two peptides were subjected to amino acid analysis using a gaseous phase sequencer (Applied 

25 Biosystems model 1470 A). The sequences shown in Table 1 were found. 

TABLE 1 

Amino acid sequences of peptides 

30 — ■ — 

N-terminal: Ala-Asp-Arg-Gu-Arg-Ser-lle-His-Asp-Phe-Xaa-Leu-Val-Ser-Lys (SEQ ID NO:1 in 
the sequence listing) 

Partial amino acid sequences 

1 : Lys-Val-Val-Gy-Arg-Xaa-Arg-Ala-Ser-Met-Pro-Arg-Trp-Trp-Tyr-Asn-Val- Thr-Asp-Gly-Ser- 
35 Xaa-Gln-Leu-Phe-Vai-Tyr-Gly-Gly (SEQ ID NO:2 in the sequence listing) 

2 : Ala-Thr- Val-Thr-Glu-Asn- Ala-Thr-Gly- Asp-Leu-Ala-Thr-Ser- Arg-Asn- Ala- Ala- Asp-Ser-Ser • 
Val-Pro-Ser-Ala-Pro (SEQ ID NO: 3 in the sequence listing) 

(Xaa: amino acid residue not yet identified) 

40 



(Purification of the protein using an A549 ceil culture supernatant and amino acid sequence analysis) 

A culture supernatant was prepared by cultivating A549 cells (obtained from the Japanese Cancer Research 
Resources Bank) in the same manner as in Example 1 . Using the culture supernatant and proceeding in the same man- 
so ner as in Example 1 , a protein having the inhibitory activity on the protease activity of HGF activator was obtained. Upon 
SDS-PAGE, this protein showed the same molecular weight as that derived from MKN45 cells. When subjected to the 
same N-terminal amino acid sequence determination as in Example 1, this protein gave the same sequence as that of 
the MKN45 cell-derived protein. This suggested the possibility of the protein being identical with the MKN45-derived 
protein. 

55 
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EXAMPLE 4 

(Method of assaying the activity of the protein inhibiting the protease activity of HGF activator as well as the activity) 

5 One to ten ^1 of the sample to be assayed was added to 30 to 40 |il of PBS-0.05% CHAPS solution containing 2 to 
5 ng of serum-derived HGF activator. After 30 minutes of incubation at 37°C. 5 to 10 .ug of single chain HGF was added 
and incubation was further continued for 2 hours. This incubation mixture was subjected to SDS-polyacrylamide gel 
electrophoresis under reducing conditions. After electrophoresis, Coomassie Brilliant Blue R250 (CBB) staining was 
performed and the proportions of single chain HGF and double chain HGF were compared for activity detection. 

10 The purified protein (10 ng) and 5 ng of serum-derived HGF activator were incubated in 30 to 40 ^1 of PBS-0.05% 
CHAPS solution at 37°C fa 30 minutes, then 10 ng of single chain HGF was added, and incubation was further contin- 
ued for 2 hours. The incubation mixture was subjected to SDS-polyacrylamide gel electrophoresis under reducing con- 
ditions followed by staining with CBB. The results are shown in Fig. 1 . In the figure, the numeral 1 indicates the case 
where neither of HGF activator and the protein was added, 2 indicates the case where HGF activator was added but 

rs the protein was not added, and 3 indicates the case where HGF activator and the protein were added. Addition of the 
protein resulted in suppression of the activity of HGF activator converting single chain HGF to double chain HGF 

EXAMPLE 5 

20 (SDS-polyacrylamide gel electrophoresis) 

For determining the apparent molecular weight of the protein having protease inhibitor activity as purified from the 
MKN45 cell culture supernatant or A549 cell culture supernatant in Example 1 or Example 2, the protein was subjected 
to SDS-polyacrylamide gel electrophoresis. The protein finally purified was subjected to SDS-polyacrylamide gel elec- 
ts trophoresis using 12.5% polyacrylamide slab gels, which was conducted under nonreducing conditions. The molecular 
weight markers used were Molecular weight markers "Daiichi*' III for Laemmli method (Daiichi Pure Chemicals). After 
electrophoresis, color development was performed using a silver stain reagent (Kanto Chemical). Upon relative com- 
parison in migration distance between the protein and the molecular weight markers, the protein obtained from the 
MKN45 cell culture supernatant or A549 cell culture supernatant showed several fragments or a smear band, presum- 
30 ably due to differences in sugar chain, amino acid residue modification or terminal region, at positions around an appar- 
ent molecular weight of about 30.000 dattons as determined by SDS-polyacrylamide gel electrophoresis. 

EXAMPLE 6 

35 (Cloning of a gene coding for the protein and base sequence determination) 

The following two oligonucleotide primers were designed based on a presumption that Xaa will be Cys from the 
standpoint of the sequences Lys-Val-Val-Gly-Arg-Xaa-Arg and Xaa-Gln-Leu-Phe-Val-Tyr-Gly-Gly selected from the par- 
tial amino acid sequence (SEQ ID NO:2 in the sequence fisting) of the protein obtained in Example 2. 

40 

Primer 1 : 5'-AAGGTNGTNGGNMGNTGYMG-3' (SEQ ID NO: 5 in the sequence listing); and 
Primer 2: 5'-CNCCGTANAGGAANARYTGRC-3' (SEQ ID NO: 6 in the sequence listing). 

(In the above sequences, N indicates T or G, M indicates A or C, Y indicates T or C. and R indicates A or G.) 

*s Then, total RNA was prepared from MKN45 cells by the method described in Anal. Biochem., Ifg, 156 (1987) and 
applied to on oligo(dT)-cellulose column, whereby poly(A)+RNA was obtained. 

Using the thus-obtained poly(A)+RNA as the template, RT-PCR (reverse transcription-polymerse chain reaction: cf . 
K. Hayashi (ed.): "PCR Ho no Saishin Gijutu (State-of-art Techniques of PCR)", page 44 and page 52, published Feb. 
5, 1995 by Yohdosha) was carried out The reaction mixture obtained by this RT-PCR was analyzed by polyacrylamid 

so gel electrophoresis, whereupon a DNA fragment of about 35 bp was detected. Therefore, this DNA fragment was 
extracted from the polyacrylamide gel, followed by phenol-chloroform extraction and ethanol precipitation, whereby the 
DNA fragment was recovered. The base sequence of the DNA fragment was determined by the dideoxy method. Fur- 
ther, this DNA fragment was labeled with 32 P by the method described in "Molecular Cloning" (Cold Spring Harbor Lab- 
oratory, 1982) and used as a screening probe. 

55 Using the MKN45 cell line-derived poly(A)+RNA together with a cDNA synthesis kit (Pharmacia), cDNA was syn- 
thesized, and a phage library was constructed, as a library for screening, with X ZAPII (Strategene) as the vector. 
Escherichia coli XL Blue (Strategene) was infected with the above phage library to give about 1 00,000 plaques. 

After overnight cultur in NZY medium, the bacterial cells were transferred to a Gene Screening Plus membrane 
(du pont). The membrane was placed on a filter paper impregnated with 0.1 M sodium hydroxide-0.5 M Tris hydrochlo- 
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ride buffer (pH 7.5) and allowed to stand for 2 minutes and then placed on a filter paper impregnated with 1 .5 M sodium 
chloride-0.5 M Tris hydrochloride buffer (pH 7.5) and allowed to stand for 5 minutes. After two more repetitions of this 
series of treatments, the membrane was washed with 2 x SSC (two-fold concentrated SSC) and air-dried on a dry filter 
paper. This membrane was irradiated with UV light at a dose of 1 20 mj/cm 2 for fixation of the DNAs transferred to the 

5 membrane. The thus-treated membrane was immersed in 50 mi of a solution comprising 50 mM Tris hydrochloride 
buffer (pH 7.5). 1 M sodium chloride and 1% SOS and maintained in that state at 65°C for 2 hours. Then, the membrane 
was immersed in 40 ml of a solution comprising 5 ng/ml of the above-mentioned 32 P-labeled probe, 100 ng/ml of 
salmon sperm, 50 mM Tris hydrochloride buffer (pH 7.5), 1 M sodium chloride and 1% SDS and maintained in that state 
at 65°C for 16 hours. Thereafter, this membrane was washed with 2 x SSC at room temperature over 5 minutes and 

w then with two portions of 0. 1 x SSC at room temperature over 30 minutes, and subjected to autoradiography, which gave 
22 positive clones supposedly containing cONA for the protein. Then, each positive clone was submitted to plasmid 
construction by the excision method (Toyobo Life Science Catalog, pages 114-115) according to Toyobo Life Science 
Catalog (pages 1 1 4- 11 5) and Strategene's manual. 

The thus obtained plasmid DNAs were then cleaved with the restriction enzyme EcqRI and subjected to agarose 

is gel electrophoresis, and a done with the longest cDNA for the protein inserted therein was selected. Then, by analyzing 
the base sequence of the plasmid (pHAI-ll) harbored by this clone, the whole base sequence of the gene coding for the 
protein was determined (SEO ID NO:4 in the sequence listing). 

EXAMPLE 7 

20 

(Preparation of an expression plasmid for the protein) 

A 10-tig portion of the plasmid (pHAI-ll) containing the cDNA for the protein as obtained in Example 6 was sub- 
jected to cleavage with the restriction enzyme E&RI. followed by agarose gel electrophoresis. Thus, an about 1.4 kb 

25 EcoRI-EcoRI DNA fragment containing the cDNA for the protein was separated and extracted. The thus-obtained DNA 
fragment wag rendered blunt-ended by the conventional method using T4 DNA polymerase and, then, the DNA frag- 
ment was purified by phenol-chloroform extraction and ethanoi precipitation and dissolved in 1 0 ul of water. Separately, 
0.05 ug of the expression vector pME1 8S (Medical Immunology, 20, 27 (1990)) was cleaved in advance with the restric- 
tion enzyme X&l, and the DNA fragment obtained was rendered blunt-ended by the conventional method using T4 

so DNA polymerase and then purified by phenol-chloroform extraction and ethanoi precipitation. This was dissolved in 400 
fit of a 1 mM MgCI 2 solution in 50 mM Tris-HCI (pH 8), 1 unit of bacterial alkaline phosphatase (Toyobo, BAP- 101 ) was 
added, and dephosphorylation treatment wag conducted at 65°C for 30 minutes. Then, the DNA fragment was purified 
from this reaction mixture by phenol-chloroform extraction and ethanoi precipitation and dissolved in 10 ul of water. 
Ligation reaction was carried out in 20 ul of a reaction mixture (66 mM Tris-HCI, pH 7.6, 6.6 mM MgCI 2 , 10 mM dithio- 

35 thrertol, 66 uM ATP) containing 0.01 ug of the pME1 8S vector-derived DNA fragment prepared as mentioned above and 
0.1 ug of the above-mentioned blunt-ended EcoRI fragment of cDNA for the protein in the presence of T4 DNA ligase 
(Toyobo LGA-101) at 14°C tor 12 hours. A 10-uJ portion of this T4 DNA ligase reaction mixture was used to transform 
Escherichia coli HB101 (Takara Shuzo) according to the manual attached thereto. The microorganism was cultured on 
a medium containing 50 ug/ml of ampicillin and scores of ampiciilin-resistant strains were obtained. These transform- 

40 ants were analyzed by the method of Maniatis et al. ("Molecular Cloning", Cold Spring Harbor Laboratory, pages 86-96 
(1982)) and, as a result a plasmid, pME18S-HAWI, containing the gene coding for the protein as inserted at the Xhgl 
restriction enzyme cleavage site occurring between the promoter and polyadenylation site of the expression vector 
pME18S could be obtained. Its structure is shown in Fig. 2. 

45 5XAMPU5 8 

(Obtaining of a cell line expressing the protein) 

The plasmid pMEl8S-HAI-ll constructed in Example 7 and containing the cDNA for the protein as inserted at the 
so XhQl restriction enzyme cleavage site of the expression vector pMEl8S was recovered and purified from the recom- 
binant Escherichia coli by the method of Maniatis et al. ("Molecular Cloning", Cold Spring Harbor Laboratory, pages 86- 
96 (1 982)) and thus a large amount of the expression plasmid DNA for the protein was obtained. COS cells were trans- 
formed by transfection thereof with the expression plasmid DNA. Thus, COS cells were first cultured in eRDF medium 
containing 10% FBS (total bovine serum) in tissue culture dishes 9 cm in diameter until a semiconfluent condition. 
55 Then, the medium was removed from the dishes, and a DNA solution prepared as mentioned below was added drop- 
wise thereto as mentioned below. First, tor each dish 9 cm in diameter, a solution was prepared in an Eppendorf cen- 
trifuge tube by adding thereto 300 til of 2 x HEBS solution (2 x HEBS solution: 1 .6% sodium chloride, 0.074% potassium 
chloride, 0.05% disodium hydrogen phosphate dodecahydrate, 0.2% dextrose, 1% HEPES (pH 7.05)) and 1 0 ug of the 
plasmid DNA and making the volume 570 ul with sterilized water. Th n, while adding 30 ul of 2.5 M calcium chloride 
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solution to the DNA solution, the tube contents were stirred vigorously using a vortex mixer for severai seconds. The 
resulting mixture was allowed to stand at room temperature for 30 minutes, with occasional stirring at intervals of about 
10 minutes using a vortex mixer. The thus-prepared DNA solution was laid on the cells mentioned above and the whole 
was allowed to stand at room temperature for 30 minutes. Then, 9 ml of eRDF medium (Kyokuto Pharmaceutical) sup- 
plemented with 10% FBS was added to each dish and incubation was performed at 37°C for 4 to 5 hours in the pres- 
ence of 5% C0 2 . Then, the medium was removed from each dish and the ceils were washed with 5 ml of 1 x TBS++ 
solution (1 x TBS++ solution: 25 mM Tris-hydrochloride (pH 7.5), 140 mM sodium chloride, 5 mM potassium chloride. 
0.6 mM disodium hydrogen phosphate, 0.08 mM calcium chloride, 0.08 mM magnesium chloride). After removing the 
1 x TBS++ solution, the cells were covered with 5 ml/dish of 1 x HEBS solution containing 10% OMSO (dimethyl sulfox- 
ide) and allowed to stand at room temperature for 1 to 2 minutes. The supernatant was then removed. The cells were 
again washed with 5 ml of 1 x TBS++ solution, 10 ml of eRDF medium supplemented with 10% FBS was added to each 
dish and incubation was performed at 37°C in the presence of 5% C0 2 . After the lapse of 48 hours, the medium was 
recovered. The supernatant recovered was 20-fold concentrated and assayed for inhibitory activity against HGF activa- 
tor in the same manner as in Example 4. The inhibitory activity was confirmed. 

While the invention has been described in detail and with reference to specific embodiments thereof, it will be 
apparent to one skilled in the art that various changes and modifications can be made therein without departing from 
the spirit and scope thereof. 
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SEQUENCE LISTING 
INFORMATION FOR SEQ ID NO: 1 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH i IS amino acids 

(B) TYPE i amino acid 
(D) TOPOLOGY: lin«ar 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: N- terminal fragment 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM t Homo sapiens 

(B) STRAIN: MKN4S 

(x) SEQUENCE DESCRIPTION: SEQ ID NO: 1 
Ala Asp Arg Glu Arg Ser He His Asp Phe Xaa Leu Val Ser Lys 
IS 10 15 

xaa: unknown amino acid 
INFORMATION FOR SEQ ID NO: 2 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal fragment 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(B) STRAIN: MXN45 

(X) SEQUENCE DESCRIPTION: SEQ ID NO: 2 
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Lys Val Val Gly Arg Xaa Arg Ala Ser Met Pr Arg Trp Trp Tyr Asn 

5 10 15 

Val Thr Asp Gly Ser Xaa Gin Leu Phe Val Tyr Gly Gly 
20 25 
xaat unknown amine acid 
INFORMATION FOR SEQ 10 NOt 3 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE i amino acid 
(0) TOPOLOGY i linear 

(il) MOLECULE TYPES peptide 

(v) FRAGMENT TYPE: internal fragment 

(vi) ORIGINAL SOURCE: 

(A) ORGANISMi Homo sapiens 

(B) STRAINS MKN45 

(X) SEQUENCE DESCRIPTION t SEQ ID NOt 3 

Ala Thr val Thr Glu Aan Ala Thr Gly Asp Leu Ala Thr Ser Arg Asn 

15 10 is 

Ala Ala Asp Ser Ser Val Pro Ser Ala Pro 

20 25 
INFORMATION FOR SEQ ID NO: 4 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 7S9 base pairs 

(B) TYPSt nucleic acid 

(C) STRANDEDNESSi double 
(0) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: CDNA to mRNA 

(iv) ANTISENSE: no 

(v) ORIGINAL SOURCE t 

(A) ORGANISMS Homo sapiens 

(B) STRAIN t KKN45 

(ix) FEATURES i 

(A) NAME/KEY » coding sequence 

(B) LOCATION: 1 to 759 

(C) IDENTIFICATION METHOD: by experiment 

(A) NAME /KEY t signal peptide 

(B) LOCATION! 1 to 81 

(C) IDENTIFICATION METHODS by experiment 

(A) NAME /KEY i mature peptide 

(B) LOCATION t 62 to 759 

(C) IDENTIFICATION METHOD} by experiment 

(x) SEQUENCE DESCRIPTIONS SEQ ID NOt 4 

ATO QCG CAG CTS TGC GGC CT6 AGG CGG AGC CGG GCG TTT CTC GCC CTO 48 
M«t Alt Gin Uu Cye GI7 Leu Arg Arg Ser Arg Ale.Fhe Leu Alt Lto 

IS 10 13 

CTG GGA TCG CTG CTC CTC TCT GGG GTC CTG GCG GCC GAC CGA GAA CGC 96 

L«u Gly Str L«u Ltu Leu Ser Gly v«l Ltu Ala Alt Asp Arg Glu Arg 

20 23 30 

AGC ATC CAC GAC TTC TGC CTG GTG TCG AAG GTG GTG GGC AGA TGC CGG 144 
Str lie Hit Alp Pht Cyi Leu Val Ser Lye Vel Vtl Gly Arg Cye Arg 

35 40 45 
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20 



25 



30 



35 



40 



45 



SO 



CCC TCC ATC CCT ACC TGG TCC TAC AAT CTC ACT GAC CCA TCC TGC CAG 192 
Al* Str Mat Pro Arg Trp Trp Tyr Atn Val Thr Atp Gly Str Cyt ©In 

50 55 60 

CTC TTT CTC TAT CCC CCC TOT CAC CCA AAC ACC AAT AAT TAC CTC ACC 240 
Lau Pht Vtl Tyr Cly Cly Cyt Atp Cly A«n Str Am Am Tyr Itu Thr 
65 70 75 80 

AAG GAS CAC TOC CTC AAG AAA TCT CCC ACT CTC ACA CAC AAT CCC ACC 238 
Lyi Clu Glu Cyt Ltu Lyt Lyt Cyt All Thr Vil Thr Olu Am Ala Thr 

85 90 95 

CCT CAC CTC CCC ACC ACC ACC AAT CCA CCC CAT TCC TCT CTC CCA ACT 336 
Cly Atp Ltu Ala Thr Str Arg Atn Ala Ala Atp Str Str Vtl Pro Ser 

100 105 110 

CCT CCC ACA ACC CAG CAT TCT CAA CAC CAC TCC AGC CAT ATG TTC AAC 384 
Ala Pro Arg Arg Gin Atp Sar Glu Atp Bit Str Str Atp Mat Pht Atn 

115 120 125 

TAT GAA CAA TAC TCC ACC CCC AAC CCA CTC ACT GGC CCT TGC CGT GCA 432 
Tyr Clu Clu Tyr Cyt Thr Ala Am Ala Val Thr Gly Pro Cyt Arg Ala 

ISO 135 140 

TCC TTC CCA CCC TCC TAC TTT CAC GTG CAC AGC AAC TCC TCC AAT AAC 480 
Str Pht Pro Arg Trp Tyr Pht Atp Val Glu Arg Atn Str Cyt Atn Atn 
145 150 155 160 

TTC ATC TAT CCA CGC TCC CCC CCC AAT AAG AAC AGC TAC CCC TCT GAG 528 
Pht lit Tyr Gly Gly Cyt Arg Cly Atn Lyt Atn Str Tyr Arg Str Clu 
165 170 175 



55 
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20 



SAC CCC TCC ATC CTC CCC TCC TTC CCC CAC CAC CAC AAT CCT CCC CTC 5 76 
Olu Alt Cyf Met Leu Ai& Cyi Hie Arg Sin Gin GXu Asn Pro Pro Liu 

180 185 190 

CCC CTT CCC TCA AAC CTC' CTC CTT CTC CCC CCC CTC ITC CTC ATC CTC 624 
Pro Leu Cly ficr Lye Val V.l v*l Leu Alt Cl y Liu Ph« Val Met Val 

1*3 200 205 

TTC ATC CTC TTC CTC CCA CCC TCC ATC CTC TAC CTC ATC CCC CTC CCA 672 
Leu lie Leu Phe Leu Cly All Ser Met Val Tyr Leu lie Arg V«l Alt 

210 213 220 

CCC ACS AAC CAC CAC CCT CCC CTC CCC ACC CTC TCC ACC TCC CCA CAT 720 
Ar§ Arg Ain Gin Clu Acg Ale Leu Ar* Thr Val Tr? Ser Ser Oly Aep 
223 230 235 240 

25 CAC AAC CAC CAC CTC CTC AAC AAC ACA TAT CTC CTC TCA 760 

Atp Lya Clu Cln Leu Val Lyt Aan Thr Tyr V*l L«u * 
243 250 

30 

INFORMATION FOR SEQ 10 NOt 5 

(i) SEQUENCE CHARACTERISTICS: 

j 5 (A) length: 20 nucleotides 

(8) TYPE i nucleic acid 
(C) STRANDEDNESS s a ingle stranded 
40 (D) TOPOLOGY: linear 

(11) MOLECULE TYPE i other nucleic acid, synthetic DNA 
(X) SEQUENCE DESCRIPTION: SEQ ID NO: 5 

45 AAGGTNGTNG GNHGNTGYMG 

N: T or G, M: k or C, Yt T or C, and Pt A or G 

50 



55 
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INFORMATION FOR SEQ ID NOt 6 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESSi single stranded 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid, synthetic DNA 
(X) SEQUENCE DESCRIPTION: SEQ ZD NO: 6 

CNCCGTANAC GAANARYTGR C 

N: T or G, M: A or C, Y: T or C, and R: A or G 
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SEQUENCE LISTING 



5 (1) GENERAL INFORMATION: 

(i) APPLICANT; 

(A) NAME: MITSUBISHI CHEMICAL CORPORATION 

(B) STREET: 5-2, Marunouchl 2-chome, Chiyoda-ku 

(C) CITY: Tokyo 

M (E) COUNTRY: Japan 

(F) POSTAL CODE (ZIP) : none 

(ii) TITLE OF INVENTION: NOVEL PROTEIN, DNA CODING FOR SAME 
AND METHOD OF PRODUCING THE PROTEIN 

75 (ill) NUMBER OF SEQUENCES: 7 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

20 (D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 

(v) CURRENT APPLICATION DATA: 

APPLICATION NUMBER: EP 96 111 861*9 



25 



30 



35 



45 



50 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: N- terminal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(B) STRAIN: MKN45 



40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

Ala Asp Arg Glu Arg Ser lie His Asp Phe Xaa Leu Val Ser Lys 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



55 
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(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(B) STRAIN: MKN45 



15 



20 



30 



35 



40 



45 



50 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Lys Val Val Gly Arg Xaa Arg Ala Ser Met Pro Arg Trp Trp Tyr Asn 
1 5 10 15 

Val Thr Asp Gly Ser Xaa Gin Leu Phe Val Tyr Gly Gly 
20 25 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



(il) MOLECULE TYPE: peptide 
25 (v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(B) STRAIN: MKN45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Ala Thr Val Thr Glu Asn Ala Thr Gly Asp Leu Ala Thr Ser Arg Asn 
1 5 10 15 

Ala Ala Asp Ser Ser Val Pro Ser Ala Pro 
20 25 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 759 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(B) STRAIN: MKN45 
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(ix) FEATURE: 

(A) NAME/KEY: COS 

(B) LOCATION: 1. .756 

(C) IDENTIFICATION METHOD: exp rimental 

(D) OTHER INFORMATION: /evidenc ' EXPERIMENTAL 

(ix) FEATURE: 

(A) NAME/KEY: sigjeptide 

(B) LOCATION: 1 . . 81 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION : / evidence 8 EXPERIMENTAL 

(ix) FEATURE: 

(A) NAME/ KEY: mat_peptide 

(B) LOCATION: 82. .756 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION : /evidence 8 EXPERIMENTAL 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

ATG GCG CAG CTG TGC GGG CTG AGG CGG AGC CGG GCG TTT CTC GCC CTG 48 
20 Met Ala Gin Leu Cys Gly Leu Arg Arg Ser Arg Ala Phe Leu Ala Leu 

-27 -25 -20 -15 

CTG GGA TCG CTG CTC CTC TCT GGG GTC CTG GCG GCC GAC CGA GAA CGC 96 
Leu Gly Ser Leu Leu Leu Ser Gly Val Leu Ala Ala Asp Arg Glu Arg 
-10 -5 1 5 

25 

AGC ATC CAC GAC TTC TGC CTG GTG TCG AAG GTG GTG GGC AGA TGC CGG 144 
Ser lie His Asp Phe Cys Leu Val Ser Lys Val Val Gly Arg Cys Arg 
10 15 20 

GCC TCC ATG CCT AGG TGG TGG TAG AAT GTC ACT GAC GGA TCC TGC CAG 192 
30 Ala Ser Met Pro Arg Trp Trp Tyr Asn Val Thr Asp Gly Ser Cys Gin 

25 30 35 

CTG TTT GTG TAT GGG GGC TGT GAC GGA AAC AGC AAT AAT TAC CTG ACC 240 
Leu Phe Val Tyr Gly Gly Cys Asp Gly Asn Ser Asn Asn Tyr Leu Thr 
40 45 50 

35 

AAG GAG GAG TGC CTC AAG AAA TGT GCC ACT GTC ACA GAG AAT GCC ACG 288 
Lys Glu Glu Cys Leu Lys Lys Cys Ala Thr Val Thr Glu Asn Ala Thr 
55 60 65 

GGT GAC CTG GCC ACC AGC AGG AAT GCA GCG GAT TCC TCT GTC CCA AGT 336 
40 Gly Asp Leu Ala Thr Ser Arg Asn Ala Ala Asp Ser Ser Val Pro Ser 

70 75 80 85 

GCT CCC AGA AGG CAG GAT TCT GAA GAC CAC TCC AGC GAT ATG TTC AAC 384 
Ala Pro Arg Arg Gin Asp Ser Glu Asp His Ser Ser Asp Met Phe Asn 
90 95 100 

45 

TAT GAA GAA TAC TGC ACC GCC AAC GCA GTC ACT GGG CCT TGC CGT GCA 432 
Tyr Glu Glu Tyr Cys Thr Ala Asn Ala Val Thr Gly Pro Cys Arg Ala 
105 110 115 



50 



55 
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20 



25 



TCC TTC CCA CGC TGG TAC TTT GAC GTG GAG AGG AAC TCC TGC AAT AAC 480 
Ser Phe Pro Arg Trp Tyr Ph Asp Val Glu Arg Asn S r Cys Asn Asn 
120 125 130 

TTC ATC TAT GGA GGC TGC CGG GGC AAT AAG AAC AGC TAC CGC TCT GAG 528 
Phe He Tyr Gly Gly Cys Arg Gly Asn Lys Asn Ser Tyr Arg Ser Glu 
135 140 145 

GAG GCC TGC ATG CTC CGC TGC TTC CGC CAG CAG GAG AAT CCT CCC CTG 576 
Glu Ala Cys Met Leu Arg Cys Phe Arg Gin Gin Glu Asn Pro Pro Leu 
150 155 160 165 

CCC CTT GGC TCA AAG GTG GTG GTT CTG GCG GGG CTG TTC GTG ATG GTG 624 
Pro Leu Gly Ser Lys Val Val Val Leu Ala Gly Leu Phe Val Met Val 
170 175 180 

TTG ATC CTC TTC CTG GGA GCC TCC ATG GTC TAC CTG ATC CGG GTG GCA 672 
Leu lie Leu Phe Leu Gly Ala Ser Met Val Tyr Leu He Arg Val Ala 
1S5 190 195 

CGG AGG AAC CAG GAG CGT GCC CTG CGC ACC GTC TGG AGC TCC GGA GAT 720 
Arg Arg Asn Gin Glu Arg Ala Leu Arg Thr Val Trp Ser Ser Gly Asp 
200 205 210 

GAC AAG GAG CAG CTG GTG AAG AAC ACA TAT GTC CTG TGA 759 
Asp Lys Glu Gin Leu Val Lys Asn Thr Tyr Val Leu 
215 220 225 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 252 amino acids 
30 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

35 Met Ala Gin Leu Cys Gly Leu Arg Arg Ser Arg Ala Phe Leu Ala Leu 
-27 -25 -20 -15 

Leu Gly Ser Leu Leu Leu Ser Gly Val Leu Ala Ala Asp Arg Glu Arg 
-10 -5 15 

40 Ser He His Asp Phe Cys Leu Val Ser Lys Val Val Gly Arg Cys Arg 

10 15 20 

Ala Ser Met Pro Arg Trp Trp Tyr Asn Val Thr Asp Gly Ser Cys Gin 
25 30 35 

45 Leu Phe Val Tyr Gly Gly Cys Asp Gly Asn Ser Asn Asn Tyr Leu Thr 
40 45 50 

Lys Glu Glu Cys Leu Lys Lys Cys Ala Thr Val Thr Glu Asn Ala Thr 
55 60 65 

50 
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Gly Asp Leu Ala Thr Ser Arg Asn Ala Ala Asp S r Ser Val Pro Ser 
70 75 80 85 

Ala Pro Arg Arg Gin Asp Ser Glu Asp His Ser S r Asp Met Phe Asn 
90 95 100 

Tyr Glu Glu Tyr Cys Thr Ala Asn Ala Val Thr Gly Pro Cys Arg Ala 
105 110 115 

Ser Phe Pro Arg Trp Tyr Phe Asp Val Glu Arg Asn Ser Cys Asn Asn 
120 125 130 

Phe lie Tyr Gly Gly Cys Arg Gly Asn Lys Asn Ser Tyr Arg Ser Glu 
135 140 145 

Glu Ala Cys Met Leu Arg Cys Phe Arg Gin Gin Glu Asn Pro Pro Leu 
150 155 160 165 

Pro Leu Gly Ser Lys Val Val Val Leu Ala Gly Leu Phe Val Met Val 
170 175 180 

Leu lie Leu Phe Leu Gly Ala Ser Met Val Tyr Leu lie Arg Val Ala 
185 190 195 

Arg Arg Asn Gin Glu Arg Ala Leu Arg Thr Val Trp Ser Ser Gly Asp 
200 205 210 

25 Asp Lys Glu Gin Leu Val Lys Asn Thr Tyr Val Leu 
215 220 225 

(2) INFORMATION FOR SEQ ID NO: 6: 

(1) SEQUENCE CHARACTERISTICS : 
30 (A) LENGTH: 20 base pairs 

( B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
35 (A) DESCRIPTION: /desc * "synthetic DNA" 



10 



15 



20 



(ix) FEATURE: 

(A) NAME/KEY: raise feature 

(B) LOCATION : groups 6 , 9, 12, 15) 

*Q (D) OTHER INFORMATION: /note- "N is T or G M 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
AAGGTNGTNG GNMGNTGYMG 20 
(2) INFORMATION FOR SEQ ID NO: 7: 



45 



(!) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

50 (C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oth r nucleic acid 

(A) DESCRIPTION: /desc "synthetic DNA" 



w 
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(ix) FEATURE: 

(A) NAME /KEY: raisc_feature 

(B) LOCATION : group ( 2 , 8, 14) 

(D) OTHER INFORMATION: /note- "N is T or G" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
CNCCGTANAC GAANARYTGR C 21 



Claims 

20 1 . A protein having the following physico-chemical properties: 

(1) a molecular weight of about 30,000 dattons as determined by SDS-polyacrylamide gel electrophoresis; 

(2) inhibitory activity on the protease activity of hepatocyte growth factor activator; and 

(3) one of the amino acid sequences depicted in the sequence listing under SEQ ID N0:1 through 3 or an 
25 amino acid sequence substantially equivalent thereto. 

2. A protein having one of the amino acid sequences depicted in the sequence listing under SEQ ID M0:1 through 3 
or an amino acid sequence substantially equivalent thereto and having inhibitory activity on the protease activity of 
hepatocyte growth factor activator. 
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3. A protein having the amino acid sequence depicted in the sequence listing under SEQ ID NO:4 or an amino acid 
sequence substantially equivalent thereto. 

4. A protein having, as its amino acid sequence, that segment of the amino acid sequence depicted in the sequence 
35 listing under SEQ ID NO:4 which starts with the 28th amino acid (alanine) residue and ends with the 252nd amino 

acid (leucine) residue, or an amino acid sequence substantially equivalent thereto. 

5. A DNA coding for the protein of Claim 1 . 
40 6. A DNA coding for the protein of Claim 2. 

7. A DNA coding for the protein of Claim 3. 

8. A DNA as claimed in Claim 7 which is represented by the base sequence depicted in the sequence listing under 
45 SEQ ID NO:4 a a base sequence substantially equivalent thereto. 
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9. A DNA coding for the protein of Claim 4. 

10. A DNA as claimed in Claim 9 which is represented by that segment of the base sequence depicted in the sequence 
so listing under SEQ ID NO:4 which starts with the 82nd nucleotide (guanine) and ends with the 759th nucleotide 

(adenine), or a base sequence substantially equivalent thereto. 

1 1 . A gene coding for the protein of Claim 1 . 
55 12. A gene coding for the protein of Claim 2. 

1 3. A gene coding for the protein of Claim 3. 

14. A gene as claimed in Claim 13 which is represented by the base sequence depicted in the sequence listing under 
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SEQ ID NO:4 or a base sequence substantially equivalent thereto. 

1 5. A gene coding for the protein of Claim 4. 

5 16. A gene as claimed in Claim 15 which is represented by that segment of the base sequence depicted in the 
sequence listing under SEQ ID NO:4 which starts with the 82nd nucleotide (guanine) and ends with the 759th 
nucleotide (adenine), or a base sequence substantially equivalent thereto. 

17. An expression vector which comprises the DNA or gene of one of Claims 5 through 16. 

18. A transformant as obtained by transformation of a host cell with the expression vector of Claim 1 7. 

19. A transformant as claimed in Claim 18. wherein the host cell is an animal cell. 

is 20. A method of producing a protein having inhibitory activity on the protease activity of hepatocyte growth factor acti- 
vator which comprises cultivating the transformant of Claim 18 or 19 to thereby produce the protein having inhibi- 
tory activity on the protease activity of hepatocyte growth factor activator. 

21. A method of protein production as claimed in Claim 20, wherein the protein is a protein defined in one of Claims 1 
20 through 4. 
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Fig. 1 
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Fig. 2 
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